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Abstract 



We consider collocated wireless sensor networks, where each node's transmissions can be heard by 
every other node. Each node has a Boolean measurement and the goal of the wireless sensor network 
is to compute a given Boolean function of these measurements. We first consider the worst case 
setting and study optimal block computation strategies for computing symmetric Boolean functions. 
We study three classes of functions: threshold functions, delta functions and interval functions. We 
provide exactly optimal strategies for the first two classes, and a scaling law order-optimal strategy with 
optimal preconstant for interval functions. We also extend the results to the case of integer measurements 
and certain integer-valued functions. We use lower bounds from communication complexity theory, and 
provide an achievable scheme using information theoretic tools. 

Next, we consider the case where nodes measurements are random and drawn from independent 
Bernoulli distributions. We address the problem of optimal function computation so as to minimize 
the expected total number of bits that are transmitted. In the case of computing a single instance of a 
Boolean threshold function, the problem reduces to one of determining the optimal order in which the 
nodes should transmit. We show the surprising result that the optimal order of transmissions depends in 
an extremely simple way on the values of previously transmitted bits, and the ordering of the marginal 
probabilities of the Boolean variables, according to the k-th least likely rule: At any transmission, the 
node that transmits is the one that has the A:-th least likely value of its Boolean variable, where k reduces 
by one each time any node transmits a one. Initially the value of A: is (« + 1 - Threshold). Surprisingly, 
the order of transmissions does not depend on the exact values of the probabilities of the Boolean 
variables, but only depends on their order 

The approach presented can be generalized to the case where each node has a block of measure- 
ments, though the resulting problem is somewhat harder, and we conjecture the optimal strategy. In 
the case of identically distributed measurements, we further show that the average-case complexity of 
block computation of a Boolean threshold function is 0{9), where 9 is the threshold. We further show 
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how to generalize to a pulse model of communication. One can also consider the related problem of 
approximate computation given a fixed number of bits. In this case, the optimal strategy is significantly 
different, and lacks an elegant characterization. However, for the special case of the parity function, we 
show that the greedy strategy is optimal. 

I. INTRODUCTION 

Wireless sensor networks are composed of nodes with limited power and bandwidth, which can 
sense the environment and wirelessly communicate with each other to complete a collaborative 
task. However, in contrast to wireless data networks, most wireless sensor network applications 
are not ininterested only in computing some relevant function of the correlated data at distributed 
sensors. For instance, one might want to compute the mean temperature for environmental 
monitoring, or the maximum temperature in fire alarm systems. In order to extract the relevant 
information from enormous quantities of data generated by sensor nodes, one needs to design 
scalable algorithms and protocols. Thus, one needs to look beyond the data-forwarding paradigm 
and study how to design efficient in-network computation and communication strategies for 
functions of interest. 

The general problem of distributed function computation in wireless sensor networks presents 
several challenges. The broadcast nature of the wireless medium means that nodes have to 
deal with interference from other transmissions. This is typically resolved by a mechanism for 
distributed scheduling of transmissions. One of the consequences is that nodes which transmit 
later can exploit both previously received transmissions as well as the structure of the function 
to be computed, in order to create a more efficient description of their own data. Further, it is 
of interest to study the benefit of multi-round protocols, possibly involving complex interactions 
between nodes, versus single round protocols, where each node transmits only once. In the 
case where nodes have random data drawn from different distributions, there is the additional 
question regarding which node should transmit, since different nodes affect the function to 
different degrees. 

In this paper, we consider the collocated network scenario where all nodes can hear all 
transmissions. Its symmetry makes it a desirable starting point for studying random planar 



networks. At most one node is allowed to transmit at any time. Each node has a Boolean 
variable and we focus on the specific problem of symmetric Boolean function computation. We 
will focus on strategies which achieve function computation with zero error for all sensor nodes. 
This is motivated by applications in distributed control and sensor-actuator networks, where each 
node needs to infer the state of the system in distributed fashion and respond in real time. We 
adopt a deterministic formulation of the problem of function computation, requiring zero error. 
We consider both the worst case setting and the average case setting where we impose a joint 
probability distribution on the node measurements. 

In Section Unl we consider the worst case setting, and address the problem of computing 
symmetric Boolean functions, which depend only on number of Is, i.e., the "type,". We study 
block computation strategies where nodes accummulate a block of measurements and employ 
block codes to achiever greater efficiency. The set of admissible strategies includes all interactive 
strategies, where a node may exchange several messages with other nodes. Since nodes can only 
transmit one at a time, the key challenge is for nodes to thoroughly exploit previous transmissions 
to compress their own data. We suppose that each node has a Boolean measurement and we 
wish to compute a given symmetric Boolean function of these measurements with zero error. We 
define three classes of functions, namely threshold functions which evaluate to 1 if the number 
of Is exceeds a certain threshold, delta functions which evaluate to 1 if the number of Is is 
exactly equal to a given value, and interval functions which evaluate to 1 if the number of Is 
is between two given lower and upper values. For worst-case computation, we provide exactly 
optimal strategies for the first two classes, and a scaling law order-optimal strategy with optimal 
preconstant, as the number of nodes increases, for interval functions. Using these results, we can 
characterize the complexity of computing percentile type functions, which are of great interest. 

In our analysis, we use lower bounds from communication complexity theory, and provide 
an achievable scheme using information theoretic tools. Further, the approach presented can be 
generalized to compute functions of non-Boolean measurements, as shown in our treatment of 
general threshold functions and the MAX function. While the proposed achievability strategy is 
exactly optimal for general threshold functions, it is only scaling law order-optimal for the MAX 



function. 

In Section |IVl we address the case where the node measurements are independent and dis- 
tributed according to given marginal Bernoulli distributions. Since the measurements are random, 
the evolution of the computation depends on the particular instances of measurements, and 
the time of termination of the computation is also accordingly random. We seek to minimize 
the total expected number of bits exchanged to achieve zero error computation. We primarily 
focus on optimal strategies for Boolean threshold functions. In the case of single instance 
computation, this is equivalent to determining the optimal ordering of nodes' transmissions so 
as to minimize the expected total number of bits exchanged. We present a surprising policy for 
ordering the transmissions and prove its optimality. The optimal policy is dynamic, depending 
in a particularly simple way on the previously transmitted bits, and on the relative ordering 
of the marginal probabilities, but, surprisingly, not on their values. The problem of optimally 
ordering transmissions of nodes is a sequential decision problem and can in principle be solved 
by dynamic programming. However, this would require solving the dynamic program for all 
thresholds and all probability distributions, which appears infeasible. We avoid this, and directly 
establish the optimal policy. The proposed solution permits a unified treatment of the problems of 
single instance computation, block computation and computation under alternate communication 
models. 

In Section IFV-Bl we turn our attention to the case where each node has a block of bits, and 
we seek to compute the Boolean threshold function for each instance of the block. This problem 
appears formidable due to the plethora of possibilities, and due to a far more complex class of 
interactive strategies for computation. However, for a certain natural restricted class of coherent 
strategies, we can establish that an analogous policy is optimal, thus establishing an upper bound 
on the optimal cost. In order to establish a lower bound across all strategies, we propose the 
approach of calculating the minimum entropy over all valid protocol partitions which respect 
fooling set constraints. While this lower bound matches the upper bound for small examples, 
a proof has remained elusive. In Section IIV-B21 we show that the average case complexity of 
computing a Boolean threshold function is 0{Q), where d is the threshold. 



In Section |IV-C[ we consider an alternate model of communication where nodes use pulses 
of unit energy to convey information. We generalize our proof technique and derive the optimal 
strategy for computing Boolean threshold functions under this model of communication. Finally, 
in Section irV-D[ we study the problem of approximate function computation given a fixed number 
of timeslots. We show that the optimal strategy for the approximate computation of threshold 
functions lacks the same elegant structure as present in the case of exact computation. However, 
for the special case of the parity function, we show that the logical strategy of first querying the 
node with maximum uncertainty, i.e., entropy, is optimal. 

II. RELATED WORK 

The the problem of worst-case block function computation with zero error was formulated in 
[?]. The authors identified two classes of symmetric functions namely type-sensitive functions 
exemplified by Mean, Median and Mode, and type-threshold functions, exemplified by Maximum 
and Minimum. The maximum rates for computation of type- sensitive and type-threshold func- 
tions in random planar networks were shown to be ©(j^) and ©( ipoioo;? ) respectively, where n 
is the number of nodes. If we impose a probability distribution on the node measurements, one 
can show that the average case complexity of computing type-threshold functions is 0(1) [?]. 

In this paper, we address the problem where every node wishes to determine the value of 
the function. This approach naturally allows the use of tools from communication complexity 
[?], where one seeks to find the minimum number of bits that must be exchanged in the worst 
case between two nodes, in order to achieve zero-error computation of a function of the node 
variables. The communication complexity of Boolean functions has been studied in [?], [?]. In 
order to increase the computational efficiency, one can consider the direct-sum problem [?] where 
several instances of the problem are solved simultaneously. This block computation approach 
results in matching upper and lower bounds in the case of the Boolean AND function [?]. In this 
paper, we considerably generalize this result to derive the worst case complexity of computing 
Boolean threshold functions in collocated networks 

If the measurements are drawn from some joint probability distribution and one is allowed 



block computation, we arrive at a distributed source coding problem with a fidelity criterion 
that is function-dependent, conceming which little is known. The problem of source coding 
with side information was studied in [?]. The extension of this approach to the problem of 
function computation with side information was studied in [?]. The problem of interactive 
function computation in collocated networks has been studied in [?]. 

Due to the broadcast nature of the wireless medium, two nodes which are close to each 
other cannot transmit simultaneously. Thus, nodes need to schedule their transmissions to avoid 
interfering with one another. The challenge now is to order nodes' transmissions so as to exploit 
the structure of the function, the side-information gained from previously transmitted bits, and 
the knowledge of the underlying distribution. Sequential decision making problems have been 
studied in various forms. The most well known problem of designing sequential experiments is 
the bandit problem [?], [?], [?]. One is given a bandit with multiple arms which offer different 
probabilistic payoffs. At each time-step, the player needs to choose which arm to play so as to 
maximize the expected long-term payoff. Thus, there is a tension between exploring which arms 
offer highest payoffs and exploiting them. Under the optimal strategy, each arm is assigned a 
dynamic allocation index and the arm with maximum index is chosen. 

In [?], an interesting problem in sequential decision making is studied, where, n nodes have 
i.i.d. measurements, and a central agent wishes to know the identities of the nodes with the k 
largest values. One is allowed questions of the type "Is X > f\ to which the central agent receives 
the list of all nodes which satisfy the condition. Under this framework, the optimal recursive 
strategy of querying the nodes is found. A key difference in our formulation of the sequential 
decision making problem is that we are only allowed to query particular nodes, and not all nodes 
at once. The problem of minimizing the depth of decision trees for Boolean threshold queries 
is considered in [?]. 

III. Worst Case Computation of Symmetric Boolean Functions 

In this section, we consider a collocated network with nodes 1 through n, where each node's 
transmissions can be heard by every other node. Thus, the problem of medium access is resolved 



by allowing at most one node to transmit successfully at any time. Each node / has a Boolean 
measurement Xi E {0, 1}, and every node wants to compute the same function /(Xi,X2, . . . ,X„) 
of the measurements. We seek to find communication schemes which achieve correct function 
computation at each node, with minimum worst-case total number of bits exchanged. We allow 
for the efficiencies of block computation, where each node i has a block of independent 
measurements, denoted by Xf . Throughout this section, we suppose that collisions do not convey 
information thus restricting ourselves to collision-free strategies as in [?]. This means that for 
the k'^ bit bk, the identity of the transmitting node depends only on previously broadcast 
bits Z?i,Z?2, • • . while the value of the bit it sends can depend arbitrarily on all previous 

broadcast bits as well as its block of measurements X^. 

It is important to note that all interactive strategies are subsumed within the class of collision- 
free strategies. A collision-free strategy is said to achieve correct block computation if each node 
i can correctly determine the value of the function block /^(Xi,X2, . . . using the sequence 
of bits Z?i,Z72,... and its own measurement block Xf . Let be the class of collision-free 
strategies for block length N which achieve zero-error block computation, and let C(f,SN,N) 
be the worst-case total number of bits exchanged under strategy Sn G The worst-case per- 
instance complexity of computing a function /(Xi,X2, . . . ,X„) is defined by 

C(f)= lim mm ^ -. 

We call this the broadcast computation complexity of the function /. 

We begin by recalling Theorem 1 in [?], which states that the complexity of computing the 
AND function of two variables is log2 3 bits. In Section IIII-Al we generalize this result to 
obtain the broadcast communication complexity of the AND function of n variables. In Section 
IIII-Bl we derive the broadcast computation complexity for the more general class of threshold 
functions. For this class of functions, we devise an achievable strategy which involves each node 
transmitting in turn, using a prefix-free codebook, and a lower bound based on fooling sets. It 
is interesting to note that the optimal strategy requires no back-and-forth interaction between 
nodes. In Sections [III-CI and IIII-DL we extend this approach to derive the broadcast computation 



complexity of computing delta functions and interval functions respectively. 

Finally, in Section ITlI-El we present some extensions to the case of non-Boolean measurements 
and to the case of non-Boolean functions. Using the intuition gained from the Boolean case, we 
show how the achievability scheme and fooling set lower bounds can be adapted. In particular 
we study general threshold functions and the MAX function. 

A. Complexity of computing the AND function 

We consider now the specific problem of computing the AND function, which is 1 if all its 
arguments are 1, and otherwise. Consider a collocated network with n nodes, each of which 
wants to compute the AND function of n variables, denoted A(Xi,X2, . . . For the case 
where n = 2, we know from Theorem 1 in [?] that the broadcast communication complexity of 
computing the AND function is log2 3 bits. We have the following result for general n. 

Theorem 1: For any strategy S^, 

C(ZiAX2...Z„,5^,^) >A^log2(n+l). 

Further, there exists a strategy 5^ such that 

C{Xx^X2...X„XN.N) < [A^log2(n + l)l+(n-2). 

Thus, the complexity of the multiple node AND function is given by C(A(Xi,Z2, . . = 
log2(n + l). 

Proof of Achievability: The upper bound is established using induction on the number of nodes 
n. From Theorem 1 in [?], the result is true for n = 2 which is the basis step. Suppose the result 
is true for a collocated network of (n—l) nodes. Consider an achievable scheme in which node 
n transmits first, using a prefix free codebook. Let the length of the codeword transmitted be 
l{Xl^). After this transmission, the function is determined for the instances where X„ = 0. For the 
instances where X„ = 1, the remaining {n — l) nodes need to compute /\{X\,X2, . . . From 
the induction hypothesis, we know that this can be done using \w{X^) log2 n~\+{n — 3) bits. Thus 
the worst-case total number of bits exchanged is L := maxxN{l{Xl^) + log2n + {n — 3)] ). 



As before, we want to minimise this quantity subject to the Kraft inequality. Consider a prefix- 
free codebook for node n which satisfies 

/(X,f ) = [A^log2(n + l)]+{n-2)- [w(X,f ) log^n] - (n - 3) 

This satisfies Kraft inequality since 

vN vN 

Proof of lower bound: The lower bound is shown by constructing a fooling set [?] of the 
appropriate size. We digress briefly to introduce the concept of fooling sets in the context of 
two-party communication complexity [?]. Consider two nodes X and 7, each of which take 
values in finite sets ^ and '3/', and both nodes want to compute some function f{X,Y) with 
zero error. 

Definition 1 (Fooling Set): A set £ C ^ x is said to be a fooling set, if for any two distinct 
elements {x\,y\), (^2,^2) in E, we have either 

• 7^/(^2,^2), or 

• =/(^2,y2), but either /(xi,3;2) ^ f{xi,yi) or /(x2,Ji) 7^/(^1,^1)- 

Given a fooling set E for a function f{X\,X2), we have C{f{X\,X2)) > log2|£'|. We have 
described two dimensional fooling sets above. The extension to multi-dimensional fooling sets 
is straightforward and gives a lower bound on the communication complexity of the function 

f{Xi,X2,...,Xn). 

We need to devise a subset of the set of all nx N measurement matrices which is a valid 
fooling set. Consider the subset E of measurement matrices which are only comprised of columns 
which sum to (n — l) or n. Since there are A'^ columns, there are (n + l)^ such matrices. Let Mi, 
M2 be two distinct matrices in this subset. If {M\) ^ {M2), then we are done. Suppose not. 
Then there must exist one instance where the function evaluates to zero and for which Mi and 
M2 have different columns. Let us suppose M\ has l,, — e,- and M2 has 1„ — ej. Now if we replace 
the fh row of M\ with the fh row of M2, the resulting measurement matrix, say M* is such 



that f{M*) 7^ /(Ml). Thus, the set £ is a valid fooling set. From the fooling set lower bound, 
we have, for any strategy Sn G S^n, we must have C{A{Xi,X2),Sf^,N) > A/^log23 implying that 
C{f) > log2 3. This concludes the proof of Theorem [T] □ 

By symmetry, we can derive the complexity of the OR function, which is defined to be if 
all its arguments are 0, and 1 otherwise. Consider a collocated network with n nodes, each of 
which wants to compute the OR function, denoted by V(Xi,X2, .. .,X„). 

Corollary 1: The complexity of the OR function is given by C(V(Xi,X2, . . . = log2(« + 
1), since we can view it as A(Xi,X2, . . . by deMorgan's laws. 

Note: Throughout the rest of this section, for ease of exposition, we will ignore the fact that 
terms like A^log2(n + 1) may not be integer. Since our achievability strategy involves each node 
transmitting exactly once, this will result in a maximum of one extra bit per node, and since we 
are amortizing this over a long block length A^, it will not affect any of the results. 

B. Complexity of computing Boolean threshold functions 

Definition 2 (Boolean threshold functions): A Boolean threshold function IIq{Xi,X2, . . . ,Xn) 
is defined as 



Theorem 2: The complexity of computing a Boolean threshold function is C{'n.e{Xi,X2, . . .X,,)) = 



Proof of Achievability: The upper bound is established by induction on n. From Theorem [Hand 
Corollary \T\ the result is true for n = 2 and for all \ < 6 < n, which is the basis step. Suppose 
the upper bound is true for a collocated network of {n — I) nodes, for all 1 < < (n — 1). Given 
a function IIq{Xi^X2t . . ,X„) of n variables, consider an achievable strategy in which node n 
transmits first, using a prefix free codeword of length 1{X^). After this transmission, nodes 1 
through n — l can decode the block Z,^. For the instances where X„ = 0, these (n — l) nodes now 
need to compute ne(Xi,X2, . . . For the instances where X„ = 1, the remaining (n—l) 

nodes need to compute IIq^i{Xi,X2, . . . From the induction hypothesis, we have optimal 





strategies for computing these functions. Let w'(X,f ) denote the number of instances of / in the 
block Xj^ . Under the above strategy, the worst-case total number of bits exchanged is 

L = m^x j /(X,f)+w«(X,f)log2 1^ ^ j +wi(X,f)log2 J ^ 

We want to minimise this quantity subject to the Kraft inequality. Consider a prefix-free codebook 
which satisfies 



/(X,f)=A^log2 



W«(X,f)l0g2 



wl(X,f)l0g2 



e 



0-1 



This assignment of codelengths satisfies the Kraft inequality since 



yN 




= 1. 



Hence there exists a prefix-free code which satisfies the specified codelengths, and we have 



L — N log2 I I , which proves the induction step. 

Proof of lower bound: We need to find a subset of the set of all n x N measurement ma- 
trices which is a valid fooling set. Consider the subset E of measurement matrices which 
consist of only columns which sum to (0 — 1) or 6. Since there are A'^ columns, there are 

( " \Y 

+ I such matrices. We claim that the set £ is a valid fooling set. Let Mi, 

ej \e-ij\ 

M2 be two distinct matrices in this subset. If f^{Mi) ^ {Mj), then we are done. Suppose 
not. Then there must exist at least one column at which M\ and A/2 disagree, say M^^ ^M^'' . 
However, both and M^^^ have the same number of ones. Thus there must exist some row, 
say z*, where M\f^ has a zero, but M^^rf^ has a one. 

(i) Suppose f{M['^) = f{M2^) = 0. Then, consider the matrix Ml obtained by replacing the 



/*th row of Ml with the i*th row of M2. The column of has ones, and hence 
f{M\^^^) = 1. Hence we have /(M*) 7^ /(Mi), 
(ii) Suppose /(m|'^) = f{M2^) = 1. Then, consider the matrix Mj obtained by replacing the 
/*th row of M2 with the i*th row of Ml. The f' column of M* has - 1 ones, and hence 
/(M*^'^) = 1. Hence we have /(M|) 7^/(M2). 

Thus, the set £ is a valid fooling set. From the fooling set lower bound, for any strat- 

( n + 1 

egy S]^ e we must have C{n0{Xi,X2,...,Xn),SN,N) > A^log2 | implying that 

V ^ 

c(n0(Xi,X2,...,x,O)>iog2 1 I. □ 



C. Complexity of Boolean delta functions 

Definition 3 (Boolean delta function): A Boolean delta function n|0i,(Xi,X2, . . . is de- 
fined as: 

' 1 ifM = 
otherwise. 



n{0}(^l,^2,---,^;j) 



Theorem 3: The complexity of computing n{0i,(Zi,X2, . . . is given by 



C(n{e}(Xi,X2,...,X„))=log2 



n+1 




+ 



+ 1 



Sketch of Proof: The proof of achievability follows from an inductive argument as before. The 
fooling set E consists of measurement matrices composed of only columns which sum up to 
— 1, or + 1. Thus the size of the fooling set is 

-1 A' 

n \ i ^ \ i ^ 
0-1 / \ / \ + 1 



D. Complexity of computing Boolean interval functions 
A Boolean interval function rij^ . . . is defined as: 



n[a,^,](^l:^2,---,^;3) 



1 if < Y^iXi < b 
otherwise. 



A naive strategy to compute the function Tl^^ ^{X\,. . . is to compute the threshold functions 
nfl(Xi, . . . and Ylh^\{X\,X2, . . . ,Xn). However, this strategy gives us more information than 
we seek, i.e., if L/^/ ^ [(^^bf'^ then we also know if < which is superfluous information 
and perhaps costly to obtain. Alternately, we can derive a strategy which explicitly deals with 
intervals, as against thresholds. This strategy has significantly lower complexity. 

Theorem 4: The complexity of computing a Boolean interval function n[Q^](Xi,Z2, . . . ,X„) 
with a + b <n is bounded as follows: 



log2 



n+1 
b+\ 



+ 



a — I 



<c(n[,,,](Xi,X2,...x,0) 



<l0g2 




. (1) 



The complexity of computing a Boolean interval function n[a^](Zi, . . . ,Z„) with a + b>n is 
bounded as follows: 



log2 



n+l 



a 



+ 



b+l 



<l0g2 



n+l 

' +{b-a+r 
a I \ b+l 



(2) 



Proof of lower bound: Suppose a + b < n. Consider the subset E of measurement matrices 
which consist of only columns which sum to {a — I), b or (b+l). We claim that the set £ is a 
valid fooling set. Let Mi, M2 be two distinct matrices in this subset. If f^{M\) ^ f^{M2), we 
are done. Suppose not. Then there must exist at least one column at which Mi and M2 disagree. 



say MpV^2^'^- 

(i) Suppose /(Mp^) =/(Mp^) = 1. Then, both M^^ and M^^ have exactly b Is. Thus there 
exists some row, say i*, where m\^^ has a 0, but ^ has a 1. Consider the matrix Mj 
obtained by replacing the i*th row of Mi with the /*th row of M2. The column of M*^ 
has 1) Is, and hence f{M\^^^) = 0, which means /(M*) 7^ /(Mi). 

(ii) Suppose f{M[^^) = /(M^^) = 0. If both m[^^ and M^^ have the same number of Is, then the 
same argument as in (i) applies. However, if has (a — 1) Is and Mj''^ has (^+1) Is, then 
there exists some row i* where m|"''' has a 0, but has a 1 . Then, the matrix M| obtained 



by replacing the i*th row of M2 with the /*th row of Mi is such that f{M^) ^ /(M2) 

-■ A' 



n \ I n \ I n 

b+1 / I a-1 



. This 



Thus, the set £ is a valid fooling set and l^l = 

gives us the required lower bound in ([T}. 

For the case where a + b>n, we consider the fooling set E' of matrices which are comprised 
of only columns which sum to a — I, a or b+ I. This gives us the lower bound in 
Proof of achievability: Consider the general strategy for achievability where node n transmits a 
prefix-free codeword of length 1{X^), leaving the remaining (n — l) nodes the task of computing 
a residual function. This approach yields a recursion for computing the complexity of interval 
functions. 



C(nr,,,i(Xi,...,X„))<l0g2 



2^(^(0- i,i-i](^iv A-1 )) _|_ 2C(n[a,&](^ivA-i) 



The boundary conditions for this recursion are obtained from the result for Boolean threshold 
functions in Theorem |2] We could simply solve this recursion computationally, but we want to 
study the behaviour of the complexity as we vary a, b and n. Define ha^i,„ := 2^^^^"'''^^^^'""'^"^K 
We have the following recursion for h{a,b,n) 

h{a,b,n) < h{a — l,b — l,n — I) + h{a,b,n — I). (3) 



We proceed by induction on n. From Theorems |2] and |3] the upper bounds in dD and <^ are 
true for n = 2 and all intervals [a,b]. Suppose the upper bound is true for all intervals [a,b] for 



(n—l) nodes. Consider the following cases. 

(i) Suppose a + b <n— I. Substituting the induction hypothesis in ([3]), we get 




(ii) Suppose a + b > n + \. Proof is similar to case (i). 

(iii) Suppose a + b = n. Substituting the induction hypothesis in dB]), we get 




n \ I n—\ 

+ 1 \+{b-a+l)\ 

a \ b+l 




< 



where some steps have been omitted in the proof of the last inequality. This establishes the 
induction step and completes the proof. □ 
1 ) Discussion of Theorem 

(a) The gap between the lower and upper bounds in ([T) and (|2]) is additive, and is upper bounded 
by log2(/? — a + 2) which is log2(n + 2) in the worst case. 

(b) For fixed a and b, as the number of nodes increases, we have a + b <n for large enough 



n. Consider the residual term, {b — a+l)\ | on the RHS in ([T])- We have 

a — I 



{b-a + 1] 



a — 1 



n + l 
b+l 



n + l 

Hence, C{Tl^a,b]{^i^ ■ ■ ■ ^^n)) = log2 | | | (H-o(l)) | . Thus, for any fixed interval 

[a,b], we have derived an order optimal strategy with optimal preconstant. The orderwise 
complexity of this strategy is the same as that of the threshold function Ilh^i{Xi, . . . ,X„). 
Similarly, we can derive order optimal strategies for computing C(n[„_Q . . . 

and C(n[^, „_^](Xi, . . . for fixed a and b. 

(c) Consider a percentile type function where [a,b\ = [an, fin], with (a + /3) < 1. Using Stir- 
ling's approximation, we can still show that 



{fi-a)n 



an — I 



n + l 
fin + l 



Thus we have derived an order optimal strategy with optimal preconstant for percentile 
functions. 

(d) Consider the function / := nu,.[a,.^fo;](Xi, . . . ,Xn) where the intervals [a/,Z?,] are disjoint, and 
may be fixed or percentile type. We can piece together the result for single intervals and 
show that 

C(/(Xi,...,X,0) =log2 [£g(a,-,Z7,-,n)(l+o(l)) j . 



where g{ai,bi,n) = < 



n + l 
b, + l 
n+l 
a,- 



if a; + bi <n 



if Qi + bi > n. 



E. Extension to general alphabets 

In Sections [III-BI - HIFDI we have studied optimal strategies for computing threshold functions, 
delta functions and interval functions of Boolean measurements. In this section, we will show 
that these results can be generalized to the case where nodes have general integer alphabets, i.e., 
X, G {0, 1 , . . . , m,}. The proofs are lengthier in this case, and to maintain clarity of presentation, 
we will focus on threshold functions and the MAX function. 

1 ) Complexity of General Threshold Functions: Consider a collocated network of n nodes, 
where node / has measurement X, G {0, 1, . . . ,m/}. 

Definition 4: A general threshold function TIq{X\,X2t . . ,Xn) is defined as below. 



n0(Xi,X2, . . .,X„) :-- 



1 iiL'Ux.>e 

otherwise 



We employ the same notation as for Boolean threshold functions, which constitute a special case 
of general threshold functions. 

Theorem 5: The complexity of computing Ile{Xi, . . . is given by 




e-i 



e-i 



n 1 y m,+ 1 

J=l / \k=l \ n-l 





where the notation [7^] refers to the coefficient of in the expression on the RHS. 
Proof: The proof proceeds by induction on the number of nodes n. From Theorem 1 in [?], 
we know that the result is true for n = 2 and all choices of m\,m2 and Q. This serves as a basis 
step for the induction. Let us suppose the result is true for a collocated network of n — 1 nodes 
and all choices of mi,m2, . . . and 9. We now proceed to prove the result for a network of 

n nodes. 

We specify a strategy 5^ in which node n transmits first. As described in [?], the optimal 
strategy consists of two stages, namely separation and coding. We begin by identifying the 
symbols in {0, 1 , . . . , m„} that need to be separated by node n. Let X„ be the mapping of X„ to 



the reduced alphabet given by {a^ ■ ■ ■ ^bn}. Subsequently, we construct a prefix-free codeword 
on the reduced alphabet. Let the length of the codeword transmitted be 1{X^). At the end of 
this transmission, the remaining n — 1 nodes need to compute a residual threshold function for 
each instance of the block. For example, if X„ = k, we are left with the task of computing 
He^kiXi, . . . ,X„_i). By the induction hypothesis, there is an achievable strategy to compute this 
residual threshold function, with complexity C{IlQ^k{Xi^ . . . ,Xn_i)). Thus the worst case total 
number of bits exchanged under this strategy is given by 



L := m_ax(/(l„^) + w"«(l„^)C(ne_,„(Zi, . . . + w"«+i(l„^)C(ne_«„_i(Xi, . . . 

+ . . . + (l„^)c(n0^,„ (Xi , . . . , x„_ 1 ) ) , 



where is the number of instances in the block where X„ = j. Our objective is to find the 

smallest L that satisfies the Kraft inequality for prefix free codes, which states that Y,xn 2^'^-^"^ < 
1 . From the definition of L, we can lower bound the LHS of the Kraft inequality. 

xl^ x^ 
From the induction hypothesis, we have that 



c(n0_fc(Zi,...,x„_i)) = iog2 



+ 



re-k-i 



'n-l 



(1 -y'"'+i; 



Thus, the smallest feasible value of L is given by 



= E 
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1 -Y 
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\-Y 



rB-b„-l 
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-ye 
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ye-i" 
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-ye-\- 



(i+F + ...+y"'")n 



1-y 

n— 1 / 2 y;ji/+l 



(4) 



n 

1=1 



1-y 



/=i 

N 



1-y 



A^log2 



+ 



n 

/=i 



2 — y^i+i 

1-y 



(5) 



where © follows from the fact that for k < a„ and k> bn, the coefficients of y^^^' and y^-^'-i 
are both zero. Thus, we have derived an upper bound on the complexity of computing general 
threshold functions in collocated networks. 

Proof of lower bound: We need to find a subset of the set of all « x A'^ measurement matrices 
which is a valid fooling set. Consider the subset E of measurement matrices which are made 
up only of the column vectors which sum to (0 — 1) or 9. Consider two distinct measurement 
matrices M\,M2 G E. Let {Mi) and f^{M2) be the block function values obtained from 
these two matrices. If f^{Mi) ^ f^{M2), we are done. Let us suppose /^(Mi) = /^(M2), 
and note that since Mi ^ M2, there must exist one column, say column j, where Mi and M2 
differ. However, since {Mi) = {M2), each column of Mi must sum to the same value as the 
corresponding column in M2. Thus there must exist rows ii and i2 such that Mi{iiJ) < M2{iiJ) 
and Mi{i2j) <M2{i2j). 

• If column j in Mi and M2 sum to — 1, then consider the new measurement matrix M* 



got by replacing the i'l' row of Mi with the i'l' row of M2. The column of M* sums to 

a value that is greater than — 1. Thus, we have f{M*) 7^ /(Mi). 
• If column j in Mi and M2 sum to 0, then consider the new measurement matrix M* got 

by replacing the i'2 row of Mi with the i'j row of M2. The f'^ column of M* sums to a 

value that is less than G. Thus, we have /(M*) ^ f{M\). 
Thus, the set £ is a valid fooling set. We now need to evaluate the size of E. The number of 
columns which sum to — 1 and 9 respectively, can be evaluated by looking at the coefficients 
at a carefully constructed generating polynomial given by 

{\+Y + ... + Y"''){l+Y + ... + Y"'^)...{\+Y + ... + Y"'"). 

This polynomial models all possible measurement vectors (Xi,X2, . . . ,X„). Thus, we can now 
calculate the size of E by looking at the coefficients of Y^ and y^^i in this polynomial. 



\E\ = 


y6 
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yd-l 




-y6 
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yd-X 




-y6 
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yd-X 



n(i+y + ...+r"': 



,1=1 



n 1 _ yra, + l 

^4 l-Y 



0(1-*"'""')) 1 1' ' 

,;=1 / \ A'=l \ n-l 



(6) 
(7) 

(8) 



where the last equation follows from the binomial expansion for negative exponents. Thus, we 
have established the required lower bound. □ 

2) The MAX function: In this section, we use the tools that we have developed to study a 
particular example, namely the MAX function. However, we no longer obtain exact results, which 
is to say that the single-round achievable scheme does not match the fooling set lower bound. This 
suggests that single round strategies are no longer optimal and it might be necessary to consider 
multi-round block computation strategies. Indeed, previous work in the area of communication 
complexity has shown a multi-round protocol that does better that our single-round scheme for 
the two node case. However, our proposed strategy is still exponentially better than the naive 



strategy of communicating all measurements. Further, it provides reasonably tight bounds and 
achieves the optimal scaling as the number of nodes increases. 

Consider nodes 1 through n organized in a collocated network as before. For simplicity, let 
us suppose that for each node i, G {0,1,..., m}. The MAX function of n measurements is 
defined in the natural way and is denoted by MAX,n{Xi,X2, . . . ,Xn). We want to determine the 
worst case complexity of computing the MAX function. 

Theorem 6: The complexity of the MAX function of n variables from the alphabet {0,1, ... ,m} 
is bounded as follows. 



Proof: We prove the result by induction on the number of nodes n. For the basis step, we 
consider the two node problem. Consider the general achievable scheme where node 1 sends 
a prefix free codeword of length 1{X^), and node 2 indicates its exact value for each of the 
instances of the block where Xi < X2. For example, if Xi = k, node 2 needs to indicate its 
value in the set {k,k + \ , . . . ,m}. Thus, the worst case total number of bits exchanged under this 
scheme is given by 



Proceeding as before, we can show that, in order to ensure a valid prefix free code with 
codelengths 1{X^) that satisfy Kraft inequality, the minimum L is given by 



For the lower bound, we can verify that the set of measurement matrices with columns exclusively 
from the set E given by 



log2(mn+ 1) < C{MAX„{Xi, . . . ,X„)) < log2 




L = max 



(Z(Xf^) )log2(m+ 1) +w'{X 



f)log2m + ...+w'"(Xi^)log2l). 




£ = {(0,0),(0,l),(l,0),...,(0,m),(m,0)}, 



is a valid fooling set. Thus we have 



log2(2m+ 1) < C{MAX,„{XuX2)) < logj 




which establishes the basis step for the induction. 

Now, let us suppose that the result is true for a network of {n — 1) nodes. Consider the 
following achievable scheme for the n node network. Node n transmits a prefix-free codeword 
of length l{Xl^). At the end of this transmission, the remaining (n — l) nodes need to compute 
the residual MAX function for each instance of the block. For example, if X„ = k, we are 
left with the task of computing the MAX function of (n—l) nodes on the reduced alphabet 
{k,k+ I, . . . ,n}. Since {k,k+ I, . . . ,n} is isomorphic to {0, 1, . . . ,n — /:}, this is equivalent to 
computing MAX„^k(Xi, . . . ,X„^i) . By the induction hypothesis, there is an achievable strategy 
to compute this residual MAX function, which we can unroll recursively. Thus the worst case 
total number of bits exchanged under this strategy is given by 



L = max (/(X,f ) + )C(MAX„(Xi, . . . + . . . + w'"(X„^)C(MAXo(Zi, . . . . 



which establishes the upper bound in the induction step. 

In order to prove the lower bound, we need to construct a fooling set of the appropriate size. 
Consider the set of nxN measurement matrices which consist of columns from the set E defined 



In order to satisfy the Kraft inequality, the smallest L that is feasible is given by 
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It is easy to check that this is a valid fooling set of size {mn+ 1)^ which gives us the required 
lower bound for the induction step. □ 

We make some observations regarding the result in Theorem |6] 

• For fixed m, we have that C{MAX,„{Xi,X2, . . . ,X,i)) = 0(log2n). This agrees with the result 
in [?] that the maximum rate of computing a type-threshold function is ©(j^^^)- Thus, the 
proposed achievable strategy is scaling law order-optimal. Further, we obtain better bounds 
on the complexity. 

(n + m \ 
I <min(nlog2(m-M),mlog2(n-M)). 

• The naive strategy for computing the MAX function consists of each node communicating 
its measurement which has a complexity of nlog2(m+ 1). For fixed m, the complexity of 
the proposed scheme is upper bounded by mlog2(n + 1), which is exponentially better than 
the naive strategy (0{\og2n) vs. 0(n)). 

IV. Average Case Computation of Symmetric Boolean Functions 

Consider a collocated network with nodes 1 through n, where each node i has a Boolean mea- 
surement Xi E {0, 1}. Xi is drawn from a Bernoulli distribution with P{Xi = 1) =: p,, and 
are independent of each other. Without loss of generality, we assume that pi < P2 < • • • < Pn- We 
address the following optimal distributed computation problem. Every node wants to compute the 
same function /(Xi,X2, . . . ,X„) of the measurements. Given a strategy for computing f{X\,X2, 
...,X„), the time of termination is a random variable. Our objective is to find communication 
strategies which achieve correct function computation at each node, with minimum expected 



total number of bits exchanged. 

In Section irV-A[ we formulate the problem of single instance computation of Boolean threshold 
functions. We identify a surprisingly simple policy and present a detailed proof of its optimality, 
by induction on the number of nodes in the network. In Section IIV-BI we consider the extension 
to the case of block computation of threshold functions, where each node has a block of 
measurements and we are allowed block coding. This problem is significantly harder, and we 
conjecture the structure of an optimal multi-round policy, building on the optimal policy for 
single instance computation. Further, we quantify the average case complexity of computing a 
Boolean threshold function in Section |IV-B2[ 

The extension of these results to an alternative model of communication, where binary in- 
formation can be encoded by the presence or absence of a pulse, is studied in Section IIV-CI 
When considering exact computation of functions of random data, it should be noted that the 
time of termination is a random variable. While the optimal strategy minimizes the expected 
time of termination, some instances of computation might terminate earlier and some much later. 
In Section IIV-DI we consider the problem of approximate function computation given a fixed 
number of timeslots. 

A. Single Instance Computation of Boolean Threshold Functions 

Let us suppose each node has a single Boolean measurement and we seek to compute a 
given Boolean threshold function. First, we note that since each node has exactly one bit of 
information, it is optimal to set = Xj^^. Indeed, for any other choice b'^. = g{bi,. . . ,Z?yt-i,^rj.), 
the remaining nodes can reconstruct b'j^ since they already know bi, . . . ,bk-\- Thus the only 
freedom available is in choosing the transmitting node as a function of b\,b2, ■ ■ ■ ,bk-\, for 
otherwise the transmission itself could be avoided. We call this the ordering problem. Thus, by 
definition, the order can dynamically depend on the previous broadcast bits. In this paper, we 
address the ordering problem for a class of Boolean functions, namely threshold functions. 

We will denote the set of measurements of nodes 1 through n by (Xi,X2, . . . which is 
abbreviated as X". We will use X'l- to denote the set of measurements (Xi, . . . . . . 



As a natural extension, we use X" to denote the set of measurements (Xi, . . . , . . . . . . ,X, 

where i < j. 

Definition 5 (Boolean threshold functions): A Boolean threshold function IIq{Xi,X2, . . . ,X„) 
is defined as 

[ 1 if LiXi > 0, 
I otherwise. 

The class of threshold functions has the property that, if one of the nodes' measurements is 
known, the residual function is still a threshold function. Given a function n„_^(X"), if node 
/ transmits its bit, we are left with the residual task of computing n„_;t-i(X'l,) if Xj = 1, and 
n„_;r,(X'l-) if Xi = 0. Thus, the ordering problem can be formulated as a dynamic programming 
problem. Let C(n„_y(-(X")) denote the minimum expected number of bits required to compute 
n„_;t(X"). The dynamic programming equation is 

C(n„_,(X")) = min{l +p,C(n„_fc„i(X"_,)) + (1 -;?,)c(n„_,(x".,))}- (9) 

i 

with boundary condition C(na(X"')) = if a = or a > m. 

To begin with, we argue that solving ^ for each n and k does indeed yield the optimal 
strategy for computing Boolean threshold functions. In particular, to derive the optimal strategy 
for computing Il„_k{X"), we first determine which node must transmit first, by solving ^ for 
n,k. Then, depending on whether Xj^j^ = or Xj-^i-j = 1, we are left with the residual task of 
computing n„_^(X"y,^jP or Il„_j^_i{X"_j,^^-^). We can determine which node should transmit 
next in either case, from the solution of (|9]) for n—\,k—\ ox n — l,k respectively. Proceeding 
recursively, one can unroll the optimal strategy for computing n„_^(Xi,X2, . . .X„). 

In (|9l), we recognise that the single-stage cost is uniformly 1 . More generally, given a function 
/(■) : [0, 1] —7- R+, one can write down a more general dynamic programming equation. 

c(n,_,(x")) = min{/(A-) + AC(n„_,_i(x'i,)) + (1 - A)c(n„_fc(x'ij)}. (lO) 

Here, one can view f{pi) as the cost of communicating the information of node / which has 
P{Xi = 1) = Pi. Indeed, for the case of single instance computation, we have f{p) = 1. In the 



sequel, we will see how this general dynamic programming formulation will allow us to study 
other problems of interest. 

For general /(■), solving the dynamic programing equation (fTOl ) may be intractable. Further, 
it is unclear at the outset if the optimal strategy will depend only on the ordering of the p/s, 
or their particular values. This makes the explicit solution of (flOl) . or even (|9]), for all n, k and 
(PiiPii - ■ -Pn) notoriously hard. However, under some conditions on /(■), we can derive a very 
simple characterization of the optimal strategy for each n and <k < n — I. Further, we observe 
that optimal strategy is independent of the particular values of the />,s, but only depends on their 
relative ordering. 

Lemma 1: Let /(■) : [0, 1] — R+ be a function such that 

• f{p)=ni-p). 

• is a monotone non-increasing function of p. 
Then the minimum in (flOl) is attained by k+\. That is, 

k+\ e argmin{/(p,) +p,C(n„_,_i(X'l,)) + (1 - AOC(n„_,(X'l,) } • (11) 

This result is true for all n and all < < n — 1 and all probability distributions with pi < P2 < 
...<Pn. 

Proof: We define the following expressions. 

Tm,k,i{^"^) = Pfc+iC(n,„_^_i(x'"^^^jP + (1 — pfc+i)c(n„,_yt(x'"(^^i)) 

-77,c(n,„_,_i(x-,) - (1 -77,oc(n,„_fc(x'!,) 

:= iPk+i -pi)C{n^.k-i{X"l^,^,,))) + {I - pkM^.n-k{X":^k+,))) 

-{i-Pi)cin„,^k{x"ii)). 

sl^l^iX"'):={p-Pk+i)C{n,„_k-i{X^^^^^^^ 

We establish the above theorem by induction on the number of nodes n. However, we need 



to load the induction hypothesis. Consider the following induction hypothesis. 

(a) r„,,^,,(X'") < f{p,) -f{pk+i) for all < < (m- 1), 1 < / < m 

(b) 41,(X") < {i-Pk+Ofipd-i^-Pd/iPk+O forall0<fc+l<(m-l),fc + 2</<m 

< Pk+if{pd~P<fiPk+i) forallO<fc<(m-l),l</<fc+l 

Observe that part (a) immediately establishes (fTT)) . 

The basis step for m = 1 is trivially true. Let us suppose the induction hypothesis is true for 
all m <n. We now proceed to prove the hypothesis for m = n + 1. 

Lemma 2: For fixed k and i>k + 2, we have 

Si'l,k,i^"^') < i^-Pk+i)f{pd - (1 -pdfiPk+i). 

Proof: See Appendix IA1[ 

Lemma 3: For fixed k and i < k, we have 

tf+i,Av(x"^') < Pk+inpi)-p.nPk+i). 

Proof: See Appendix IA2I 

Lemmas |2] and [3] establish the induction step for parts (b) and (c) of the induction hypothesis. 
We now proceed to show the induction step for part (a). 

Lemma 4: For fixed k and i>k + 2, we have 

Tn+l,kAX"^') < +Pk+inPi) -PifiPk+i). 

Proof: See Appendix IA3[ 

Lemma 5: For fixed k and i < k, we have 

Tn+lMX'^^') < ^?1,.,(X"+^) + {l-Pk+l)f{Pi) - (1 -Pi)f{Pk+l). 

Proof: See Appendix IA4[ 

Applying Lemmas |4] and |5] together with Lemmas |2] and |3l we see that yt , (X"+^) < for 



all <k <n and i ^ fc+ 1. For the case i = k+ 1, we have T{n+ l,k,k+ 1) = trivially. This 
completes the induction step for part (a), and the proof of the Theorem. □ 

Using Lemma [H we can now simply derive the optimal sequential communication strategy 
for computing a single instance of the Boolean threshold function n„_yt(X"). 

Theorem 7: In order to compute a single instance of the Boolean threshold function Il„_i^{X"), 
it is optimal for node (k+l) to transmit its bit first. 

Proof: In the case of single instance computation, we have f{p) = 1. Hence, trivially, we have 
that f{p) = f{l — p), and that ^^y^ is a monotone non-increasing function of p. From Lemma 
[H we have 

^+1 e argmin{/(77,) +;?,C(n„_,_i(X" ,)) + (1 - Pi)C{n„_k{X"_,)} . 

i 

Thus, in order to compute the Boolean threshold function n„_fc(X"), it is optimal for node k-\- 1 
to transmit first. □ 

Remark 1: At the outset, there are two heuristics that one may apply to the ordering problem. 
First, if we believe that Hn-kC^") evaluates to 0, the conditional optimal strategy is for nodes 
to transmit in order starting with node 1. Alternately, if we believe that n„_j(.(X") evaluates 
to 1, the conditional optimal strategy is for nodes to transmit in reverse order starting with 
node n. Thus, the result in Theorem |7] can be viewed as an appropriate hedging solution which 
safeguards against the event that n„_y(.(X") could evaluate to or 1. It is indeed surprising that 
a particularly simple hedging strategy is optimal for all n, all k and all probability distributions, 
and that it does not depend on the actual values of the probabilities but only on their order. 

B. Block Computation of Boolean Threshold Functions 

We now shift attention to the case where we allow nodes to accumulate a block of A'^ 
measurements, and thus achieve improved efficiency by using block codes. The most general 
class of interactive strategies are those where the identity of the node transmitting the k'''^ bit, 
say Tk can depend arbitrarily on all previously broadcast bits, and the bit itself can depend 
arbitrarily on all previously broadcast bits as well as Ty^'s block of measurements. We require 



that all nodes compute the function with zero error for the block, and wish to minimize the 
expected number of bits exchanged per instance of computation, denoted ^(n„_y(.(X")). While 
the problem of finding the optimal strategy in this general class of strategies appears intractable, 
we derive the optimal solution under a restricted class of strategies. The restriction we impose is 
natural, and we conjecture that the optimal strategy in this restricted class is also optimal among 
all interactive strategies. 

Define the following restricted class of coherent strategies. 

Definition 6: Coherent Strategies When computing n„_yt(X") for a block of A'^ measurements, 
a coherent strategy mandates that the first node to transmit, say node T'(l), must declare 
its entire block using a Huffman code. Note that this does not exclude interactive strategies, 
since, subsequent to node r(l)'s transmission, we have two subproblems over sub-blocks of 
measurements corresponding to instances where Xj^j^ = and Xj-^i) = 1. For each of these 
subproblems, we could potentially have different nodes transmitting first. Thus nodes may 
transmit more than once. However each of these nodes are again constrained to communicate 
their entire subblock of measurements. 

Theorem 8: In the restricted class of coherent strategies, in order to compute n„_;t(X") for a 
block of measurements, it is optimal for node fc + 1 to transmit its entire block first, using the 
Huffman code. This result is true for asymptotically long block lengths, for all n, all < < « — 1, 
and all probability distributions with p\ < pi < . . . < Pn- 

Proof: Let us suppose node i transmits first. Under a coherent strategy, node / must communicate 
its entire block, which requires an average description length of H{pi) bits per instance. This 
can be achieved asymptotically by using the Huffman code to compress node Ts block of 
measurements Q. Subsequent to node Ts transmission, we are left with the residual tasks of 
computing n„_yt-i(X'l,) for the subblock where Xt = 1, and Yln-kO^-j) for the subblock where 
Xi = 0. These are two block computation problems again. Let ^(/(n„_;t(X") denote the minimum 
number of bits per instance, that must be exchanged under this restricted class of strategies. We 

'For clarity of presentation, we will ignore the fact that the Huffman code for block length N has average codelength 
between \NH{p)\ and [NH{p)\ + 1 bits. The extra one bit can be amortized over long block lengths. 



can write a dynamic programming equation as before. 

^[/(n„_,(X")) = min{//(pO + A-^t/(n„_,_i(X'l,)) + (1 -pi)^u{nn_k{X"_d)}, (12) 

where H{p) is the standard binary entropy function defined by H{p) = —p\og2{p) — (1 — 
p) log2(l —p)- The boundary condition for ([10] ) is given by ^{/(na(X'")) = if a = or a > m. 

Observe that (fT2l) is a special case of (flOt where f{p) = H{p). Thus, for the class of coherent 
strategies, the problem of optimal computation once again reduces to an ordering problem. If we 
can show that H{p) satisfies the conditions in Lemma [B the result follows immediately. Clearly 
H{p) =H{1 — p) and one can verify that 

^[nr} _ log2(i-p) 

dp p^ ~ 

Thus, we have that is a non-decreasing function of p. Hence, from Lemma [H the optimal 
strategy for computing n„_yt(X") for a block of measurements is for node A;+ 1 to transmit its 
entire block first, using the Huffman code. □ 

Remark 2: The proposed optimal strategy is inherently interactive, since nodes do transmit 
more than once. This is due to the recursive splitting of the original block of measurements 
depending on nodes' transmissions. This is illustrated in the computation tree for n2(X-'), where 
node 2 first transmits its entire block using a Huffman code, and the computation proceeds as 
shown. In practice, all nodes need to agree a priori on a traversal order in the computation tree, 
e.g., depth-first traversal or breadth-first traversal. 

Remark 3: The proposed optimal strategy is asymptotically optimal in the limit of long blocks. 
This is necessary to achieve an average description length of H{pi) bits per instance. In practice, 
one could simply choose a large enough block length A'' so that the average description length is 
close enough to the entropy. In this context, it is important to note that, as the computation 
proceeds, the original block gets recursively subdivided into smaller and smaller subblocks 
of measurements. Each of these subblocks needs to be large enough to achieve an average 
description length that is close enough to the entropy of the transmitting node. Thus, in the 



worst case, we could have upto 2" subblocks in the computation tree, and we assume that each 
of these subblocks are large enough, which is ensured by choosing to be suitably large. 

1) A Strategy-independent Lower Bound: Next, we would like to determine if the class of 
coherent strategies considered above is rich enough to include the absolute optimal strategy for 
interactive block computation without any restrictions on a node encoding all its information 
using a Huffman code. Intuitively, since all the instances of the block are independent and 
identically distributed, it appears suboptimal for nodes to communicate only partial information 
regarding their blocks at any stage. Thus, it is plausible that, under the optimal strategy, one node 
communicates its entire block, and the computation proceeds recursively from there. However, 
establishing this fact rigorously is a formidable challenge. In this subsection, we describe a 
possible approach toward establishing this result, by adapting the concept of fooling sets. Fooling 
sets are a classical tool for establishing lower bounds in communication complexity [?], and have 
recently been used to establish tight lower bounds on the minimum number of bits exchanged 
in the worst-case in collocated networks [?], and tree networks [?]. We describe an extension 
of fooling sets to the probabilistic scenario and use this to establish a lower bound. 

We recall that, for the threshold function Iln_k{X"), a valid fooling set of maximum size is 
given by 

En^„^k:={X":t,Xi = n-kor f^X, = n-k-\} 

i=l 1=1 

Any correct protocol for distributed computation of n„^(X") partitions the function matrix into 
monochromatic rectangles [?]. Further, each rectangle in the partition can contain at most one 
element of E„ n_]^. Let D(n„_yt(X")) be the set of all protocol partitions of the function matrix of 
n„_^(X") which respect the fooling set constraints. Suppose we use a protocol with associated 
partition d, the number of bits that must be exchanged under this protocol is lower bounded 
by the entropy of this partition, denoted by H{p{d)), where p{d) is the implied probability 
distribution on the elements of the partition. Thus, the number of bits that must be exchanged 



under any protocol is bounded by 

^(n„_fc(X"))> min //(p(^i?))=:^i(n„_,(X")). (13) 

deD(n„_,(x«)) 

We conjecture that this lower bound is achievable and in particular, the optimal strategy described 
in Theorem [8] achieves it. 

Conjecture 1: The lower bound described in (fT3T ) satisfies the dynamic programming equation 
in ([HI). 

<^z.(n,_,(X")) = min{H{pi)+pi^Lin„_k-i{X"_,)) + (1 - pi)^Lin„_k{X"_,))}. 

Since ^L(n„_fc(X")) < ^(n„_fe(X'')) < ^{/(n„_^(X")), we conjecture that the optimal strategy 
described in Theorem |8] is in fact optimal among all interactive strategies. 

We note that the above conjecture has been verified by hand for all threshold functions of 
three variables. A formal proof of the conjecture, however, remains a challenge for the future. 

2 ) Average Case Complexity of Computing Boolean Threshold Functions: In this section, we 
quantify the average case complexity of computing Boolean threshold functions in collocated 
networks. For simplicity, we suppose that nodes' measurements are independent and identically 
distributed, and propose a natural block computation strategy that is easy to analyze. 

Theorem 9: Suppose that the nodes' measurements Xi,X2,...,X„ are independent and iden- 
tically distributed with p{Xi = I) = p. Then, the average case complexity of zero error block 
computation of the threshold function IIq{Xi,X2j. . . jX„) is 0(6) bits. 

Proof: We need to describe a coding strategy which achieves zero error block computation, 
as block length A'^ goes to infinity. Let us suppose that nodes communicate in reverse order 
starting with node n. Node n encodes its block of A'^ measurements using a Huffman code which 
requires \NH{p)~\ bits. Having heard all previous transmissions, each successive node discards 
the instances of the block that are already determined, i.e., those instances of the block that have 
already recorded 6 ones. It then constructs the Huffman code for the remaining instances of the 
block. Let the number of determined instances after node / + 1 transmits be denoted by random 
variable Z,. Then, the average complexity of computing a function block of length is given 



by 



f^{N-E{Zi))H{p) = 0NH{p)+NH{p)'Y,'^\' | - p)'"^'- (14) 
1=1 ,=e,/=o\j 



We need to somehow carefully upper bound the RHS in the (fT4l) . We start by establishing the 
following lemma. 

Q 

Lemma 6: Define ge := -f^ for Q a positive integer. Then 



Proof of Lemma: The proof is by induction on Q. For = 1, we have g\ = = g^^^ trivially. 
For 6 > 1, observe that ge-i —ge =x^^^ and hence g^^ =g^Q_i^ — (0 — 1)!. By the induction 
assumption, we have 

(e-i)_ d ( (0-2)! \ ..,_/ (0-l)! , 

which completes the induction. □ 

We now proceed to show that the second term on the RHS in (fT4l) is smaller than QNH{p) {^^-^ 
for each n. The proof is by induction on 0. For = 1, the second term is given by 



I 



Define i?g :=E"=e E;=o ^■'(1 ^ ■ Then, we have the following recursion: 

J, 



^=e\e-l 7=0 \ j 



From the induction hypothesis, we have that 




e-1 



[9-\){\-p) p^-i J( 



e-i Me-i) / 



X 



Now, applying Lemma [6] in (fT5] ). we can show 7?'^ < which establishes the induction 

step. Substituting the upper bound for the second term in the RHS of (fT4l) . we obtain that the 
total number of bits transmitted is less than ^^^^p^ for all n. This yields a sum rate of -^^^ 
which completes the proof. □ 

We make some observations regarding the above result. 

(i) For a type-threshold function [?] with threshold vector [01,02], we can run two parallel 
schemes with thresholds [0i,O] and [0, 62], thus attaining a sum rate (^1+^2)-^^^) _ Since we 
typically consider 61,62 to be constants independent of n, we obtain that the average case 
complexity of computing Boolean threshold functions is 0(1). 

(ii) As a special case, the average case complexity of computing a symmetric Boolean Disjunc- 
tive Normal Form with bounded minterms is 0(1). 

C. Computation under an alternate communication model 

In this section, we illustrate how we can adapt the solution to the general dynamic pro- 
gramming equation described in Lemma [T] to a different communication model. We return to 
the problem of computing a single instance of a Boolean threshold function n„_yt(X") in the 
broadcast scenario. Let us suppose that time is slotted, and that nodes transmit information in 
the form of pulses, which have unit energy cost. Under this alternate communication model, our 
modified objective is to minimize the expected total energy expended in transmissions. 

In contrast to Section IIV-AI where the cost of transmitting a bit is uniformly 1, under the 
pulse model of communication, silence can be used to convey information with zero cost. This 



is similar to the silence-based communication model studied in [?]. Thus, the communication 
problem is no longer trivial. However, since each node makes a Boolean measurement, the value 
of its bit can be mapped to the presence or absence of a pulse in two ways. Either node / 
transmits a pulse to indicate X, = 1 and remains silent to indicate X, = 0, or vice versa. Clearly, 
the optimal communication strategy is as follows: 

• If < 5, then node / transmits a pulse to indicate Xi= 1. 

• If /'i > 5> then node i transmits a pulse to indicate Xi = 0. 

We are still left with the problem of determining the optimal ordering of transmissions. 

Let f?(n„_fc(X")) be the minimum expected total energy required in order to compute the 
threshold function n„ under this communication model. The problem of minimizing the 

expected total energy can be formulated as a dynamic programming equation as follows 



(n„_,(X")) = mm{mm{pi, 1 - ^0 +A-^(n„„fc-i(X'lJ) + {\ ~ p,)cff{n„_k{X'L,))} (16) 



From Lemma [B we have the following result. 

Theorem 10: In order to compute a single instance of the Boolean threshold function Il,j_j^{X") 
under the pulse communication model, it is optimal for node k+l to transmit first. 
Proof: Observe that (fT6] | is a special case of (flOl) where f{p) = min(p, I— p). Hence, in order to 
establish the result, it is sufficient to show that min(p, I — p) satisfies the conditions in Lemma 
[TJ Indeed, min(p, I — p) is symmetric about p = ^ and we have. 





is a monotone non-increasing function of p. The theorem then follows directly 



from Lemma \T\ □ 



D. Approximate Function Computation 

In Sections IIV-AI through IIV-C[ we have considered the problem of computing Boolean 
threshold functions with zero error. While we have focused on constructing optimal strategies 



to minimize the expected total number of bits exchanged during computation, we must note that 
the worst-case total number of bits exchanged might still be «. In some applications however, we 
might have a constraint on the number of bits exchanged, or equivalently, the number of timeslots 
available for computation. In this case, one cannot always hope to compute the function exactly. 
Instead, we consider approximate function computation where we seek to minimize certain error 
metrics. 

To begin with, let us consider the class of Boolean threshold functions. As before, we permit 
all interactive strategies where the choice of next transmitting node can depend arbitrarily on all 
previously broadcast bits. Let us suppose that we are allowed to exchange at most {n — 0) bits 
in order to compute the threshold function n„_^(X"). We propose two metrics of error, namely 
probability of error and conditional entropy of the function. 

• Probability of error: Let Pr'iJln-kO^')) denote the minimum probability of error after at 
most (n — 9) bits are exchanged. Note that these bits are exchanged in sequential fashion, 
since we are computing in a broadcast network. Hence, the identity of each successive 
transmitting node can depend on the previously transmitted bits. The sequential nature of 
this problem permits a dynamic programming formulation analogous to (flOl) . 

P^'\n„.k{X-))=mm{p,P^'\n,,^k-i{'^l,) + {\-pi)P^^^ (17) 

with the boundary condition pi^^(ne-fc(X^)) = min(/'(ne_^(X^) = l),/'(ne-fc(X^) = 0)). 

• Conditional entropy of function: Let //(^)(n„_yt(X")) denote the minimum conditional 
entropy of the function after at most {n — 0) bits are exchanged. As before, the identity 
of each successive transmitting node can depend on the previously transmitted bits. Once 
again, the sequential nature of this problem permits a dynamic programming formulation 
analogous to (flOl) . 

//(^)(n„_,(x")) = min{p,//(^)(n„_,_i(x'i,o + (i-P/)//(^Hn„_,(x'i,0}^ (i8) 

with the boundary condition H^^\Tle_kO^^)) = H{TlQ_kO^^)). 



1) Counter-example: At fsubirst glance, one would expect that the optimal strategy for 
approximate function computation would match the strategy for exact function computation, 
thus verifying that the strategy proposed in Theorem |7] is increasingly correct. Unfortunately, 
this is not true as shown by the following counter example. 

Let us suppose that we want to compute n2(Xi,X2,X3) and we are allowed to exchange 
exactly one bit. We have exactly three choices of strategy. Either node 1 transmits first, or node 
2, or node 3. Consider the case where p\ = 0.7,^2 = 0.82, p3 = 0.84, then one can calculate the 
conditional entropy when node 1 transmits first (respectively node 2 and node 3). 

H(2)(n2(Zi,Z2,Z3)|Zi) = p,H{{l-p2){l-p3)) + {I -pi)H{p2P3) =0.4002. 
//(2)(n2(Xi,X2,X3)|X2) = P2H{{1 -pi){l~p3)) + {I -p2)H{p,p3)= 0.4991. 
//(2)(n2(Zi,Z2,X3)|Z3) = P3H{{l-pi)il-p2)) + il-p3)H{piP2)= 0.4121. 

Contrary to our expectation, it is not always optimal for node 2 to transmit first. This is also true 
for the probability of error metric. Again, consider the approximate computation of n2(Xi,X2,X3) 
where we are only allowed to exchange exactly one bit. For the case where pi = 0.6, p2 = 
0.12, p3 = 0.84, the probability of error expressions for the three strategies are given by 

;7imin((l -p2)(l -Ps), 1 - (1 -P2)(l -Ps)) 
+ {l ~ Pi)mm{p2P3A - P2P3) =0.1850, 
;72min((l-;7i)(l - p3),\ - {\ - pi){\ - P3)) 
+ {\ - P2)mm{pip3, \ - P1P3) =0.1850, 
773min((l - pi){l ~P2)A-{1 - P\){.^ - Pi)) 
+ (1 -P3)min(j3ij!72, 1 =0.1632. 

Thus, it appears that the structure of the optimal solution when we seek approximate com- 
putation given a fixed number of bits, is somewhat different from the optimal strategy for zero 
error computation. 



pr^(n2(Xi,X2,X3)|Xi) = 

P?\li2{Xx,X2,X3)\X2) = 
pP(n2(Xi,X2,X3)|X3) = 



2) Special case of the parity function: While the structure of the optimal strategy for the 
approximate computation of threshold functions remains elusive, the case of the parity function is 
solvable. In this section, we show that an intuitive greedy strategy is optimal for the approximate 
computation of the parity function. To begin with, the parity function of n Boolean variables 
labeled X\ through X„ is defined as follows: 



We consider the computation of 4>(X'') in a broadcast scenario where X, ^ Bern{pi). If we 
seek exact computation, the problem becomes trivial since each node must transmit its bit. Hence, 
we will consider approximate computation of parity under the conditional entropy metric. Let 
us suppose that nodes are allowed to exchange upto {n — Q) bits. Let //(^^(^»(X")) denote the 
minimum conditional entropy of the function after {n — 9) bits are exchanged. The dynamic 
programming equation analogous to (fTST ) is 



with the boundary condition //(^)(*(X^)) = h{P{^{X^) = 1)). One can derive the solution to 
(fT9l ) and hence deduce the optimal strategy for approximate computation of parity. 

Theorem 11: In order to minimize the conditional entropy of 4>(X") after (n — O) bits are 
exchanged, it is optimal for the node with highest binary entropy to transmit first. Subsequently, 
the node with the next highest entropy transmits, and so on until {n — 6) bits are transmitted. 
Proof: First, we note that (fTSl ) implies that the optimal strategy for approximate computation 
of ^»(X") is not data-dependent. Indeed, if node / transmits first, irrespective of whether Xj = 
or Xi= 1, we have the residual task of computing 4>(X'l-) given at most {n — 6 — I) bits. Thus, 
the optimal strategy can be specified a priori and does not depend on the particular values of 
the bits exchanged. Further, if our objective is to minimize the conditional entropy after {n — 6) 
bits, we are only interested in determining the optimal subset of nodes that must transmit, and 






(19) 



the order of transmission within this set is irrelevant. Thus, we have 

H^^\^{X"))= min H{c^{X")\Xs). 
SC{l,...,n} 

\s\ = n-e 

Let A = {ai,a2, . . . ,a„_e} be an optimal set of nodes. We claim that A consists of nodes with 
the (n — O) highest entropies among the n nodes. Suppose not. Then there exists nodes a* 
and Ui e A such that H{pa*) > H{pa). Consider the set A* := (A \ {a,}) IJI^*}- 

= H{paA^{X"_iA,a*))) = !) + (! - Pa.)Pm^-iA,a*))) = «)) 
< H{pa*PmX"^^^^^.^)) = 1) + (1 -p«0^(^(X'L(^,,*))) = 0)) 

which contradicts the assumption that A is an optimal subset. Thus, under the toptimal strategy, 
the set of transmitting nodes must be those with the highest entropies. A candidate strategy 
which achieves this is one where nodes transmit in decreasing order of their binary entropies. □ 

V. Concluding remarks 

We have addressed the problem of computing symmetric Boolean functions in a collocated 
wireless sensor network. In the worst case setting, we have derived optimal strategies for 
computing threshold functions, and order optimal strategies with optimal preconstant for interval 
functions. The approach presented can be extended in two directions. First, one can consider 
non-Boolean alphabets and functions which depend only on Alternately, one can consider 
non-Boolean functions of a Boolean alphabet. The fooling set lower bound and the strategy for 
achievability can be generalized to both these cases. 

In the average case setting, we have considered some sequential decision problems, that arise 
in the context of optimal distributed computation of Boolean functions of random data. The 



broadcast nature of the medium forces nodes to communicate sequentially, and the challenge is 
to order nodes' transmissions so as to both exploit the structure of the function and the knowledge 
of the underlying distribution. 

For single instance computation of Boolean threshold functions, we have shown the surprising 
result that the optimal strategy has an elegant structure, which depends only on the ordering of 
the marginal probabilities, but not on their exact values. The extension to the case of block 
computation is harder. However, we have derived the optimal strategy for a restricted class of 
coherent strategies, which we conjecture to be optimal in general. The proof technique presented 
allows a unified treatment of these two problems, and also allows extension to altemate pulse 
models of communication where nodes transmit pulses of energy. 

Finally, we have considered the problem of approximate function computation, where we are 
given a fixed number of bits and seek to minimize the error in the estimate of the function. We 
have shown that this problem is more complicated and the optimal strategy lacks the structure 
that we observed in the case of exact computation. However, for the special case of the parity 
function, a simple greedy strategy is optimal. There remain several open problems concerning 
optimal computation in wireless sensor networks, including the case of correlated measurements, 
and generalizing the sequential decision making approach to handle more general functions. 

Appendix 

A. Proofs of Lemma [7] 

1 ) Proof of Lemma ^ First, let us suppose k = 0. In this case 

^+iA^(x"+') = (/^i-/'0c(n„(x"4[,.p) + (i-Pi)c(n^^^ 

However, by assumption, we have < (1 —pi)f{pi) — (1 — Pi)f{pi)- 



Next, consider the case where k^O. 



{Pk+i -p,-)c(n„_,(x"_+^^i .p) + (1 -;.,+i)c(n„_,+i(x"+;^ip) - (i -p,-)c(n„_fc+i(x'it^)) 



(Pk+i -Pi) [/(p^) +p^c(n„_,_i(x"+i,^j^^.p) + (1 -p,)c(n,_,(x"+i,^j^^.p) 

+(1 -Pk+i) [fiPk) +PkC{u„_k{^'iYk^k+i))) + (1 -p^)c(n„_,+i(x': 
-(1 -Pi) [fiPk)+PkCin„_k{^'i+u^)) + (1 -p,)c(n„_,+i(x"+,i .p) 



(20) 



Pk 



{Pk+i-Pi)c{n,-k-i{x'l+lk+^,~^)) 



+(1 -p,+i)c(n„_,(x"_+i,^iP) - (1 -;,,)c(n„_fc(x"_+i .))) 



{pk+i-P,)c{nn-k{^ 



n+l 
{k,k+l4) 



)) 



(21) 



+(1 -p,+i)c(n„_fc+i(x'i+/,^ip) - (1 -;,oc(n„_,+i(x':+i^.p) 



Pk 



(w+i-p,Oc(n,_,_i(x':+i,^i^^.p) 



+(1 -;.fc+i)c(n„_,(x"+i,^ip) - (1 - p,)c{Un-k{x'!_+l.^))\ + (1 - w)si;^i,,_i(x"_+i) 
(p,+i-p,-)c(n„_,_i(x"+i,^j^,.p) 

+(1 - pk+i)c{n„_kix"_+l,^^^)) - (1 -;.,)c(n„_,(x':+,i,.p) 



+(1 -Pk) [(1 - (1 -pdfiPk+L 



(22) 



(p^+i -p/)c(n„_,_i(x':+i,^j^^.p) + (1 -;.,+i)c(n„_,(x"+i,^jp) 

-(1 -P,)[nPk+i +w+iC(n„_n(x'i+;,^i .))) + (1 -7,fc+i)c(n„_,(x'i+;,_^^ .)))] 
+(1 -Pk) [(1 - (1 -/'O/b^+i)] 



(23) 



Pk{i -Pk+i) c(n„_,(x':+;,^jp) - AC(n„_,_i(x':+;,^j^^.p) - (i -;,,)c(n„_,(x"+;,^j .p) 



-p;t(i - + (1 -Pk) [(1 - (1 -Pi)npk+i)] 

Pk{i -pk+i)f{pi)-{i-Pi)f{pk+i) + {'^ -pk){\-Pk+\)f{Pi) 
(1 -Pk+i)f{pi) - (1 -Pi)f{pk+i) 



(24) 



Equation (|20l ) follows from the optimal ordering for computing n„_^(X";^^^^j .^), n„_yt+i 

and n„_yt_|_i (X"|'), which is true by the induction hypothesis for m = n. The inequality (|22|) 



follows from the induction hypothesis that sl^l_^ -(X"!/) < (1 — Pk+i)f{Pi) — (1 — Pi)f{Pk+i)- 
Equality in (l23l) and (l24l) follows from the optimal ordering for computing n„_;t(X'l^^ .-j) and 



) respectively. □ 
2) Proof of Lemma \3\- First, let us suppose k = n. In this case 



^+i,,v(x"+^) = (A-Pn+i)c(no(x':+i,^^p)+p,+iC(no(x'i+;^iP)-p^^^ 



n+l 



However, by assumption, we have < Pn+ifiPi) — Pif{Pn+\, 
Next, consider the case where k < n. 



{p,-Pk+i)c{n„_k{x'i+lj^^^^)) +;.,+iC(n,_,(x"+i^ip) -p,c(n„_,(x'i+i)) 



(p-Pk+i) l/(m2)+P^+2C(n„_,_i(x"+i,^i,^2))) + i^-Pk+2)c{n,-k{x'i\lk+i,k+2)^) 



n+l 



+Pk+l 
-Pi 

Pk+2 



f{Pk+2) + w+2C(n„_,_i(x"+i^i ,^2))) + (1 - w+2)c(n„_,(x"+i^i^,^2))) 
fiPk+i) +Pk+2C{u„^k-i{x'l\l^^^-^)) + (1 -77,+2)c(n„_,(x'!+i,^2))) 
(A--;^.+i)c(n„_,_i(x'L|i,^ )) 



(25) 



h2)^ 

+Pk+lC{Tln-k-l{X"^^l^^j^^2))) - PiC(n„_A:-l (X"|.|^^2))) 



-(fc+l,^:+2)^ 

+(1 -Pk+2) (;^/-p^+i)c(n„_,(x'i+i,^j ,^2))) +p^+iC(n„_,(x':+i^j ,^2))) 

+P.+iC(n„_,(x"+,i^j,^2)))-A-c(n„_,(x"+i,^2)))^ 

72+1 



< (1-^2) [(A-;?^+i)C(n„,,(X'l+;,^j ,^2))) 

+ W+lC(n„_fc(X"|^^^j^^^2))) -P'C(nn-fc(X""(^^^^2))) +i^fc+2 

< i^-Pk+2) [(A-p^+i)c(n„_,(x"+i,^j,^2))) 

+7^.+iC(n„_,(x"4i^j,^2)))-A-c(n.-.(x"_+i,^2)))^ 

+P/t+2 [Pk+lf{Pi) - Pif{Pk+l)] 



r'(2) ^ 
^«,yt,A^-(yt+2)^ 



(26) 



+Pk+2 [Pk+\f{Pi) -Pif{Pk+\)] 
= (1 -Pk+2)Pk+\ C(n;7-fc(X"|^^^j^^,^2))) ~P'^(n"-^-l(^"'('>+l,/t+2))) 

-(1 -Pk+2)Pif{Pk+\) +Pk+2 [Pk+\f{Pi) - Pif{Pk+\)] 
< (1 -Pk+2)Pk+\f{Pi) - Pif{Pk+l) + Pk+2Pk+lf{Pi) 
= Pk+\f{Pi)-Pif{Pk+\) 



(27) 



(28) 



Equation (l25l) follows from the optimal ordering for computing n„_yt(X""|.^^^j-|), n„_yt(X"^^^j^^ 
and n„„^(X"^^), which follows from the induction hypothesis for m = n. The inequality (|26l ) fol- 
lows from the induction hypothesis that ^'i^i ;(^-^a^+2)) — Pk+ifiPi) ~ Pif{Pk+i)- Equations (l27l) 



and (l28l) follow from the optimal ordering for computing n„_;t(X"|r^^2)) and n„_fe(X ^^^^^ ^,^2). 
respectively. □ 

3) Proof of Lemma First, we observe that 



-iPk+i-Pdcin,~kix'l+l^,.^)). 



Thus it is enough to show that 



p,+iC(n„_,(x"_+i^jP)-p,c(n„_,(x'i+i)) 



< {pk+i -pi)C{U„_k{X"_+l^^ .^)) +Pk+if{pi) -Pif{Pk+i) for i>k + 2. 



First, observe that for k = n, the statement is vacuously true since ?' > n + 2 is impossible. Hence, 
let us suppose that k < n. We have 

p,+iC(n,_,(x«+i iO)-p,c(n„_,(x'i+i)) 



Pk+\ 
-Pi 
Pk+i 



/(w+2) +P^+2C(n„_,_i(x'i+i^i ,^2))) + (1 - w+2)c(n„_,(x"+i^j ,^2))) 
nPk+i+Pk+iC{n,-k-i{x"+l^,.^)) + (1 -p,+i)c(n„_,(x'i+i^i^^.p)' 

fiPk+i) +m2C(n,_,_i(x':+,i^j ,^2))) -P/C(n,_,_i(x"+,i^j^,.p) 



(29) 



+Pk+i{i -p,+2)c(n„_,(x"+i^j ,^2))) _p,^j)c(n„_,(x"+i^j .))) - 



/(;.,) + (1 -;.,)c(n„_,(x"+;^j^^.p) - (1 -;?^+2)c(n„_,(xi+i^^ ,_^2))) 



+P/t+i(i-pyt+2)c(n„_fc(x 



n+l 
-{k+l,k+2) 



)) - -PA.+i)c(n„_,(x"+i^j -Pif{Pk+i) (30) 



= (p^+i -;?,)C(n„_fc(X«+;^j .))) +Pk+if{Pi) -Pif{Pk+i) 

Equation [29l follows from the optimal order for computing n„_fc(X"|^^jP and n„_;t(X"^^). The 
inequality in [30l follows from the induction hypothesis r„ ^. ,(X'^|^^^jP < f{pi) — fiPk+i)- 
4) Proof of Lemma \5\- First, we observe that 



7;+i,.,(X"+i) -52i^,^,(X"+i) = (1 ~ Pk+i)C{Un^k+i{X'l\l^,))) - (1 -7.,)C(n„_,+i(X'lf)) 

-(p-Pfc+i)c(n„_,(x"+i,^jP). 

Thus it is enough to show that 



(1 -;.fc+i)c(n„_,+i(x"_+,i^jp) - (1 -p,)c(n„_,+i(x'i+i)) 



< (p/-p,+i)C(n„_,(X'l+i,^jp) + (l-p,+ i)/(p,-)-(l-A-)/(w+i) for 



First, observe that for k = 0, the statement is vacuously true since ?' < is impossible. Hence, 
let us suppose that k> 0. We have 



(1 -p,+i)c(n„_,+i(x':+Vip) - (1 -p,)c(n,_,+i(x'i+^)) 

(1 -pk+i) [f{pk)+PkC{n„^k{x"_+l,^,^)) + (1 -;.,)c(n„_,+i(x"+i,^ip)' 



-(1 -Pi) [/(w+i) + w+ic(n„_,(x"+i,^ip) + (1 -p,+i)c(n„_,+i(x'i-[,;,^ip) 
(1 -Pk+i) [fiPk) + (1 -p^)c(n,_,+i(x"+i,^jp) - (1 -p,)c(n„_,+i(x'!+i,^jp) 



Tll+l 



(31) 



-p^+i)c(n„_,(x"+i,^jp) -p,)c(n„_,(x"+i,^jp) - (i - 



'{k,k+l) 
rn+l 



< (1 -p^+i) [f{p.) +;^,c(n„_,(x"+i,_^jp) -;,,c(n„_,(x'i+i,^, J) 



(32) 



-p,+i)c(n,_,(x"+i,^iP) + (1 -Pk+OfiPi) - (1 -P/)/(;?fc+i; 



Equation (|3T1) follows from the optimal order for computing n„_y(.+i(X"^^^jP and n„_fc+i (X"^^). 
The inequality in (l32l) follows from the induction hypothesis 7]j,fc-i,;(X"^^^jP <f{Pi) —f{Pk) 



□ . 



